Inventi Impact: Reconfigurable Computing

Articles

Inventi:erc/102695/25

An Efficient Convolutional Neural Network Accelerator Design on FPGA Using the Layer-to-Layer Unified Input Winograd Architecture

01-Jul-2025 Research 2025 : July-September

Jie Li, Yong Liang, Zhenhao Yang, Xinhai Li

Convolutional Neural Networks (CNNs) have found widespread applications in artificial intelligence fields such as computer vision and edge computing. However, as input data dimensionality and convolutional model depth continue to increase, deploying CNNs on edge and embedded devices faces significant challenges, including high computational demands, excessive hardware resource consumption, and prolonged computation times. In contrast, the DecomposableWinograd Method (DWM), which decomposes large-size or largestride kernels into smaller kernels, provides a more efficient solution for inference acceleration in resource-constrained environments. This work proposes an approach employing the layerto- layer unified input transformation based on the DecomposableWinograd Method. This reduces computational complexity in the feature transformation unit through system-level parallel pipelining and operation reuse. Additionally, we introduce a reconfigurable, columnindexed Winograd computation unit design to minimize hardware resource consumption. We also design flexible data access patterns to support efficient computation. Finally, we propose a preprocessing shift network system that enables low-latency data access and dynamic selection of theWinograd computation unit. Experimental evaluations on VGG-16 and ResNet-18 networks demonstrate that our accelerator, deployed on the Xilinx XC7Z045 platform, achieves an average throughput of 683.26 GOPS. Compared to existing approaches, the design improves DSP efficiency (GOPS/DSPs) by 5.8×.

How to Cite this Article
Attribution/ CC Compliant Citation: Li, Jie, et al. "An efficient convolutional neural network accelerator design on FPGA using the layer-to-layer unified input winograd architecture." Electronics 14.6 (2025): 1182. https://doi.org/10.3390/electronics14061182 https://creativecommons.org/licenses/by/4.0/ Some formatting elements, header, footer, logos, dates and pagination were modified while adapting this article.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Reconfigurable Computing

Articles

Inventi:erc/102695/25

An Efficient Convolutional Neural Network Accelerator Design on FPGA Using the Layer-to-Layer Unified Input Winograd Architecture

How to Cite this Article

Links

Contact Us